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AMENDMENTS TO THE CLAIMS 

Please amend claims 1, 14, 27, 39, 50 and 61 as follows. 
Please add new claim 71 . 

Listing of Claims: 

1 . (Currently amended) A computer-implemented method for minimizing 
effects of outlier data on data modeling comprising: 

selecting a modeling parameter from a plurality of modeling parameters 
characterizing a mixture of Student distribution components; 

computing a tractable approximation of a posterior distribution for the selected 
modeling parameter based on an input set of data and a current estimate of a posterior 
distribution of at least one unselectcd modeling parameter in the plurality of modeling 
parameters; 

computing a lower bound of a log marginal likelihood as a function of current 
estimates of the posterior distributions of the modeling parameters, the current estimates 
of the posterior distributions of the modeling parameters including the computed tractable 
approximation of the posterior distribution of the selected modeling parameter; 

determining if the lower bound has been satisfactorily optimized, wherein the 
lower bound is satisfactorily optimized when the computed lower bound has changed less 
than a threshold amount from a previously computed lower bound; 

generating a probability density modeling the input set of data, the probability 
density including the mixture of Student distribution components, the mixture of Student 
distribution components being characterized by the current estimates of the posterior 
distributions of the modeling parameters, if when the lower bound is satisfactorily 
optimized; and 

outputting the probability density. 
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2. (Original) The method of claim 1 wherein the computing operations comprise 
a first iteration and further comprising: 

selecting a different modeling parameter from the plurality of modeling 
parameters and repeating in a subsequent iteration the operations of computing a tractable 
approximation and computing a lower bound using the newly selected modeling 
parameter, if the lower bound is not satisfactorily optimized in the first iteration. 

3. (Original) The method of claim 1 wherein computing a lower bound 
comprises: 

computing the lower bound of the log marginal likelihood as a function of prior 
distributions of the modeling parameters. 

4. (Original) The method of claim 1 wherein computing a tractable 
approximation of a posterior distribution comprises: 

computing a variational approximation of the posterior distribution of the selected 
modeling parameter. 

5. (Original) The method of claim 1 wherein one of the plurality of modeling 
parameters represents a mean of each of the Student distribution components. 

6. (Original) The method of claim 1 wherein one of the plurality of modeling 
parameters represents a precision matrix of the Student distribution components. 

7. (Cancelled). 

8. (Original) The method of claim 1 wherein one of the plurality of modeling 
parameters represents a scaling parameter of a precision matrix of the Student distribution 
components. 

9. (Original) The method of claim 1 wherein one of the plurality of modeling 
parameters represents a mixing coefficients parameter of the Student distribution 
components. 
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10. (Original) The method of claim 1 wherein generating a probability density 
comprises: 

generating the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters and an 
estimate of the number of degrees of freedom of each Student distribution component. 

1 1 . (Original) The method of claim 1 further comprising: 

storing the current estimates of the posterior distributions of the modeling 
parameters in a storage location. 

12. (Previously presented) The method of claim 1 wherein the input set of data 
represents auditory speech data from an unknown number of speakers. 

13. (Previously presented) The method of claim 1 wherein the input set of data 
represents image segmentation data from images. 

14. (Currently amended) A computer program product encoding a computer 
program for executing on a computer system a computer process for minimizing effects 
of outlier data on data modeling , the computer process comprising: 

selecting a modeling parameter from a plurality of modeling parameters 
characterizing a mixture of Student distribution components; 

computing a tractable approximation of a posterior distribution for the selected 
modeling parameter based on an input set of data and a current estimate of a posterior 
distribution of at least one unselected modeling parameter in the plurality of modeling 
parameters; 

computing a lower bound of a log marginal likelihood as a function of current 
estimates of the posterior distributions of the modeling parameters, the current estimates 
of the posterior distributions of the modeling parameters including the computed tractable 
approximation of the posterior distribution of the selected modeling parameter; 
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determining if the lower bound has been satisfactorily optimized, wherein the 
lower bound is satisfactorily optimized when the computed lower bound has changed less 
than a threshold amount from a previously computed lower bound; 

generating a probability density modeling the input set of data, the probability 
density including the mixture of Student distribution components , the mixture of Student 
distribution components being characterized by the current estimates of the posterior 
distributions of the modeling parameters, if when the lower bound is satisfactorily 
optimized; and 

outputting the probability density. 

15. (Original) The computer program product of claim 14 wherein the computing 
operations comprise a first iteration and further comprising: 

selecting a different modeling parameter from the plurality of modeling 
parameters and repeating in a subsequent iteration the operations of computing a tractable 
approximation and computing a lower bound using the newly selected modeling 
parameter, if the lower bound is not satisfactorily optimized in the first iteration. 

16. (Original) The computer program product of claim 14 wherein computing a 
lower bound comprises: 

computing the lower bound of the log marginal likelihood as a function of prior 
distributions of the modeling parameters. 

17. (Original) The computer program product of claim 14 wherein computing a 
tractable approximation of a posterior distribution comprises: 

computing a variational approximation of the posterior distribution of the selected 
modeling parameter. 

18. (Original) The computer program product of claim 14 wherein one of the 
plurality of modeling parameters represents a mean of each of the Student distribution 
components. 
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19. (Original) The computer program product of claim 14 wherein one of the 
plurality of modeling parameters represents a precision matrix of the Student distribution 
components. 

20. (Cancelled). 

2 1 . (Original) The computer program product of claim 14 wherein one of the 
plurality of modeling parameters represents a scaling parameter of a precision matrix of 
the Student distribution components. 

22. (Original) The computer program product of claim 14 wherein one of the 
plurality of modeling parameters represents a mixing coefficients parameter of the 
Student distribution components. 

23. (Original) The computer program product of claim 14 wherein generating a 
probability density comprises: 

generating the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters and an 
estimate of the degrees of freedom of each Student distribution component. 

24. (Original) The computer program product of claim 14 wherein the computer 
process further comprises: 

storing the current estimates of the posterior distributions of the modeling 
parameters in a storage location. 

25. (Previously presented) The computer program product of claim 14 wherein 
the input set of data represents auditory speech data from an unknown number of 
speakers. 

26. (Previously presented) The computer program product of claim 14 wherein 
the input set of data represents image segmentation data from images. 
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27. (Currently amended) A system for minimizing effects of outlier data on 
data modeling comprising: 

a modeling parameter selector selecting a modeling parameter from a plurality of 
modeling parameters characterizing a mixture of Student distribution components; 

a tractable approximation module computing a tractable approximation of a 
posterior distribution for the selected modeling parameter based on an input set of data 
and a current estimate of a posterior distribution of at least one unselected modeling 
parameter in the plurality of modeling parameters; 

a lower bound optimizer module computing a lower bound of a log marginal 
likelihood as a function of current estimates of the posterior distributions of the modeling 
parameters, the current estimates of the posterior distributions of the modeling parameters 
including the computed tractable approximation of the posterior distribution of the 
selected modeling parameter, and determining if the lower bound has been satisfactorily 
optimized, wherein the lower bound is satisfactorily optimized when the computed lower 
bound has changed less than a threshold amount from a previously computed lower 
bound; 

a data model generator generating a probability density modeling the input set of 
data, the probability density including the mixture of Student distribution components, 
the mixture of Student distribution components being characterized by the current 
estimates of the posterior distributions of the modeling parameters, if when the lower 
bound is satisfactorily optimized; and 

an output device outputting the probability density. 

28. (Original) The system of claim 27 wherein the lower bound optimizer 
computes the lower bound of the log marginal likelihood as a function of prior 
distributions of the modeling parameters. 

29. (Original) The system of claim 27 wherein the tractable approximation 
module computes a variational approximation of the posterior distribution of the selected 
modeling parameter. 

RCE Submission -Reply to Final OA mailed June 25, 2007 
Application Number: 10/724,586 
Attorney Docket Number: 305414.01 

7/26 


PATENT 


30. (Original) The system of claim 27 wherein one of the plurality of modeling 
parameters represents a mean of each of the Student distribution components. 

3 1 . (Original) The system of claim 27 wherein one of the plurality of modeling 
parameters represents a precision matrix of the Student distribution components. 

32. (Cancelled). 

33. (Original) The system of claim 27 wherein one of the plurality of modeling 
parameters represents a scaling parameter of a precision matrix of the Student distribution 
components. 

34. (Original) The system of claim 27 wherein one of the plurality of modeling 
parameters represents a mixing coefficients parameter of the Student distribution 
components. 

35. (Original) The system of claim 27 wherein the data model generator 
generates the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters and an 
estimate of the degrees of freedom of each Student distribution component. 

36. (Original) The system of claim 27 further comprising: 

a memory storing the current estimates of the posterior distributions of the modeling 
parameters. 

37. (Previously presented) The system of claim 27 wherein the input set of data 
represents auditory speech data from an unknown number of speakers. 

38. (Previously presented) The system of claim 27 wherein the input set of data 
represents image segmentation data from images. 
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39. (Currently amended) A computer-implemented method for minimizing 
effects of outlier data on data modeling comprising: 

computing a tractable approximation of a posterior distribution for a selected 
modeling parameter of a plurality of modeling parameters characterizing a mixture of 
Student distribution components based on an input set of data and a current estimate of a 
posterior distribution of at least one unselected modeling parameter in the plurality of 
modeling parameters; 

determining whether current estimates of the posterior distributions of the 
modeling parameters are satisfactorily optimized in relation to a predetermined criterion, 
the current estimates of the posterior distributions of the modeling parameters including 
the computed tractable approximation of the posterior distribution of the selected 
modeling parameter; 

modeling the input set of data by the mixture of Student distribution components, 
the mixture of Student distribution components being characterized by the current 
estimates of the posterior distributions of the modeling parameters; and 

outputting the modeling of the input set of data. 

40. (Original) The method of claim 39 wherein the computing operation and 
determining operation comprise a first iteration and further comprising: 

selecting a different modeling parameter from the plurality of modeling 
parameters and repeating in a subsequent iteration the operations of computing a tractable 
approximation and computing a lower bound using the newly selected modeling 
parameter, if the lower bound is not satisfactorily optimized in the first iteration. 

4 1 . (Previously presented) The method of claim 39 wherein the operation of 
determining whether current estimates of the posterior distributions of the modeling 
parameters are satisfactorily optimized comprises: 

computing a lower bound of the log marginal likelihood as a function of prior 
distributions of the modeling parameters and a variational posterior distribution; and 

determining whether the lower bound satisfies the predetermined criterion of the 
selected modeling parameter. 
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42. (Original) The method of claim 39 wherein computing a tractable 
approximation of a posterior distribution comprises: 

computing a variational approximation of the posterior distribution. 

43. (Original) The method of claim 39 wherein one of the plurality of modeling 
parameters represents a mean of each of the Student distribution components. 

44. (Original) The method of claim 39 wherein one of the plurality of modeling 
parameters represents a precision matrix of the Student distribution components. 

45. (Cancelled). 

46. (Original) The method of claim 39 wherein one of the plurality of modeling 
parameters represents a scaling parameter of a precision matrix of the Student distribution 
components. 

47. (Original) The method of claim 39 wherein one of the plurality of modeling 
parameters represents a mixing coefficients parameter of the Student distribution 
components. 

48. (Original) The method of claim 39 wherein modeling the input data 
comprises: 

generating the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters and an 
estimate of the degrees of freedom of each Student distribution component. 

49. (Original) The method of claim 39 further comprising: 

storing the current estimates of the posterior distributions of the modeling 
parameters in a storage location. 
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50. (Currently amended) A computer program product encoding a computer 
program for executing on a computer system a computer process for minimizing effects 
of outlier data on data modeling , the computer process comprising: 

computing a tractable approximation of a posterior distribution for a selected 
modeling parameter of a plurality of modeling parameters characterizing a mixture of 
Student distribution components based on an input set of data and a current estimate of a 
posterior distribution of at least one unselected modeling parameter in the plurality of 
modeling parameters; 

determining whether current estimates of the posterior distributions of the 
modeling parameters are satisfactorily optimized in relation to a predetermined criterion, 
the current estimates of the posterior distributions of the modeling parameters including 
the computed tractable approximation of the posterior distribution of the selected 
modeling parameter; 

modeling the input set of data by the mixture of Student distribution components, 
the mixture of Student distribution components being characterized by the current 
estimates of the posterior distributions of the modeling parameters; and 

outputting the modeling of the input set of data. 

5 1 . (Original) The computer program product of claim 50 wherein the computing 
operation and determining operation comprise a first iteration and further comprising: 

selecting a different modeling parameter from the plurality of modeling 
parameters and repeating in a subsequent iteration the operations of computing a tractable 
approximation and computing a lower bound using the newly selected modeling 
parameter, if the lower bound is not satisfactorily optimized in the first iteration. 

52. (Previously presented) The computer program product of claim 50 wherein 
the operation of determining whether current estimates of the posterior distributions of 
the modeling parameters are satisfactorily optimized comprises: 

computing a lower bound of the log marginal likelihood as a function of prior 
distributions of the modeling parameters and a variational posterior distribution; and 
determining whether the lower bound satisfies the predetermined criterion. 
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53. (Original) The computer program product of claim 50 wherein computing a 
tractable approximation of a posterior distribution comprises: 

computing a variational approximation of the posterior distribution of the selected 
modeling parameter. 

54. (Original) The computer program product of claim 50 wherein one of the 
plurality of modeling parameters represents a mean of each of the Student distribution 
components. 

55. (Original) The computer program product of claim 50 wherein one of the 
plurality of modeling parameters represents a precision matrix of the Student distribution 
components. 

56. (Cancelled). 

57. (Original) The computer program product of claim 50 wherein one of the 
plurality of modeling parameters represents a scaling parameter of a precision matrix of 
the Student distribution components. 

58. (Original) The computer program product of claim 50 wherein one of the 
plurality of modeling parameters represents a mixing coefficients parameter of the 
Student distribution components. 

59. (Original) The computer program product of claim 50 wherein modeling the 
input data comprises: 

generating the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters and an 
estimate of the degrees of freedom of each Student distribution component. 
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60. (Original) The computer program product of claim 50 wherein the computer 
process further comprises: 

storing the current estimates of the posterior distributions of the modeling 
parameters in a storage location. 

6 1 . (Currently amended) A system for minimizing effects of outlier data on 
data modeling comprising: 

a tractable approximation module computing a tractable approximation of a 
posterior distribution for a selected modeling parameter of a plurality of modeling 
parameters characterizing a mixture of Student distribution components based on an input 
set of data and a current estimate of a posterior distribution of at least one unselected 
modeling parameter in the plurality of modeling parameters; 

an optimizer module determining whether current estimates of the posterior 
distributions of the modeling parameters are satisfactorily optimized in relation to a 
predetermined criterion, the current estimates of the posterior distributions of the 
modeling parameters including the computed tractable approximation of the posterior 
distribution of the selected modeling parameter; 

a data model generator modeling the input set of data by the mixture of Student 
distribution components, the mixture of Student distribution components being 
characterized by the current estimates of the posterior distributions of the modeling 
parameters; and 

an output device outputting the modeling of the input set of data. 

62. (Previously presented) The system of claim 61 wherein optimizer module 
computes a lower bound of the log marginal likelihood as a function of prior distributions 
of the modeling parameters and a variational posterior distribution, and determines 
whether the lower bound satisfies the predetermined criterion. 

63. (Original) The system of claim 61 wherein the tractable approximation 
modules computes a variational approximation of the posterior distribution of the 
selected modeling parameter. 
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64. (Original) The system of claim 61 wherein one of the plurality of modeling 
parameters represents a mean of each of the Student distribution components. 

65. (Original) The system of claim 61 wherein one of the plurality of modeling 
parameters represents a precision matrix of the Student distribution components. 

66. (Cancelled). 

67. (Original) The system of claim 61 wherein one of the plurality of modeling 
parameters represents a scaling parameter of a precision matrix of the Student distribution 
components. 

68. (Original) The system of claim 61 wherein one of the plurality of modeling 
parameters represents a mixing coefficients parameter of the Student distribution 
components. 

69. (Original) The system of claim 61 wherein modeling the input data 
comprises: 

generating the probability density including the mixture of Student distribution 
components, the mixture of Student distribution components being characterized by the 
current estimates of the posterior distributions of the modeling parameters and an 
estimate of the degrees of freedom of each Student distribution component. 

70. (Original) The system of claim 61 further comprising: 

a memory storing the current estimates of the posterior distributions of the 
modeling parameters. 

71. (New) The method of claim 1 wherein the input set of data includes only 
observed data. 
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